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Abstract 

This article examines arbitrage investment in a mispriced asset when 
the mispricing follows the Ornstein-Uhlenbeck process and a credit-constrained 
investor maximizes a generalization of the Kelly criterion. The optimal 
differentiable and threshold policies are derived. The optimal differen- 
tiable policy is linear with respect to mispricing and risk-free in the long 
run. The optimal threshold policy calls for investing immediately when 
the mispricing is greater than zero with the investment amount inversely 
proportional to the risk aversion parameter. The investment is risky even 
in the long run. The results are consistent with the belief that credit- 
constrained arbitrageurs should be risk-neutral if they are to engage in 
convergence trading. 



Myron [Scholes] once told me they are sucking up nickels from 
all over the world. But because they are so leveraged that amounts 
to a lot of money. 

Merton Miller about the essence of arbitrage. 

1 Introduction 

Arbitrageurs are people who detect inconsistencies in asset prices and invest in 
them hoping that the inconsistencies will be ehminated. The waiting time is 
often uncertain and since the arbitrageur depends on the willingness of other 
people to lend him money, the irrationality of creditors may lead to great de- 
bacles long before prices converge to consistent values. The notorious story of 
the arbitrage fund LTCM that lost 90 percent of its value on "riskless" deals 
illustrates the importance of credit constraints. So what policy should the ar- 
bitrageur pursue when creditors impose borrowing constraints? In particular, 
can the arbitrageur allocate the available funds in such a way as to eliminate 
all the long-run risk? 

If this risk elimination is possible, the mispricings should be equally attrac- 
tive to risk-averse as well as risk-neutral investors. However, a popular view 
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asserts that arbitrageurs, unlike other investors, should be risk-neutral. Is there 
any ground for this belief? The present article provides a justification by solving 
for arbitrageurs' optimal policies under several types of constraints and showing 
that under some of them the long-run risk cannot be eliminated. Risk-averse 
investors that face those constraints are not interested in small mispricings. 

The paper investigates two classes of constraints that lead to strikingly dif- 
ferent results. Under constraints from the first class, the arbitrageur can only 
change the leverage slowly. In practice, borrowing additional funds takes time: 
The arbitrageur must apply for new credit, provide an explanation why he needs 
it and wait for a decision. Depending on the situation, the process might take 
from several minutes to several days. In addition a rapid increase in a position 
adversely affects prices, so in their own interests arbitrageurs must accumulate 
positions slowly. 

For this class of policies, the main result is that the optimal policy is linear in 
the mispricing, and independent of the coefficient of risk aversion. The variance 
of the portfolio wealth does not grow with time. Thus, under this constraint 
the long-run risk can be expunged. 

Constraints of the second type are stronger: The arbitrageur cannot change 
leverage except by closing the position. The motivation is that the arbitrageur 
is often restricted in his ability to change the leverage - even if the need arises. 
Higher leverage is mostly needed when mispricing is increasing and the invest- 
ment account shows negative performance. Unfortunately, this is the worst time 
to ask for new credit because the creditors hate to invest in accounts with neg- 
ative performance. As Mark Twain said: "A banker is a fellow who lends you 
his umbrella when the sun is shining but wants it back the minute it rains." 

For policies in this class, the main result is that the long-run risk cannot be 
completely removed. Consequently, the arbitrageur will invest an amount that 
is inversely proportional to his risk aversion. 

These two examples suggest that what makes the convergence arbitrage risky 
in the long run is the inability of the arbitrageur to change the investment 
amount after the investment is committed. In particular, the results of the 
second example are consistent with the belief that arbitrageurs should be risk- 
neutral if they are to engage in convergence trading. 

The results of the present paper match closely with results of Grossman and Vila (1992'){ 
who study the dynamic investment under a constraint on investment amount. 
They find that the constraint essentially makes the investor behave as if he 
were more risk-averse than he actually is. Unlike in the present paper, however, 
the asset process is not mean-reverting in Grossman and Vila (1992) so the in- 
vestor could not hope to eliminate the risk completely. Also the constraint is 
not exogenous as in the present paper but a function of the investor's wealth. 
Because of these differences it is difficult to conclude whether the similarity of 
results is incidental or not. Both papers, however, support the view that certain 
constraints increase long-run riskiness of investment projects. 

use 



In a recent paper about convergence trading, Liu and Longstaff (2000) 



the Brownian bridge to model the mispricing process, an assumption on the 
process that requires a fixed horizon at which the mispricing will be effaced. By 
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the nature of their model, they cannot draw conclusions about long-run risks 
but they do find that arbitrageurs sometimes cannot eliminate all risk at the 
end of the arbitrage period. This result is consistent with results of the present 
paper. 

Closely related is the literature on optimal dynamic investments with risky 
assets. The seminal contributions to this literature were made in Samuelson (1969)) 



Merton (1969)1 [Merton (1971)| and [Merton (1973)| Optimal investment in as 



sets that follow a diffusion process with mean-reverting returns was analyzed in 
Kim and Omberg (1996)[ [Brennan et al. (1997]] [Campbell and Viceira (1999)| 



Barberis (2000)[|Wachter (2002)||Campbell et al. (2003)| The focus of the present 



paper is not on general risky investments but on the optimal extraction of profit 
from near arbitrage opportunities. Concequently, the paper comes to more def- 
inite conclusions by using a generalization of the Kelly investment criterion, 
which emphasizes the long-run behavior of portfolios and especially suitable to 
modelling objectives of large institutional traders. 

In addition, this paper computes the optimal leverage using a new method. 
While Kim and Omberg ingeniously solve dynamic programming equations by 
reducing them to a system of non-linear ordinary differential equations, Camp- 
bell and Viceira derive an approximate solution by linearization of Euler equa- 
tions, and Wachter uses the martingale method of Cox and Huang (1989) to 
separate consumption and financing decisions, the present paper derives the so- 
lution by methods of stochastic control, taking the advantage of the asymptotic 
investment criteria. 

The rest of the paper is organized as follows. Section 2 describes the model. 
Sections 3 and 4 derive the optimal differentiable and threshold policies and de- 
scribe their properties. Section 5 compares results for differentiable and thresh- 
old policies and concludes. 



2 Model 

An investor can invest in a mispriced asset whose mispricing is measured by 
X — \n{pi/p2)- Here pi and p2 are the asset's actual and "correct" prices, 
respectively. Mispricing follows the Ornstein-Uhlenbeck process: 

dxt = —axtdt + (jdzt, (1) 

where Xt is mispricing at time t, zt is a standard Wiener process, cr > and 
a > 0. Parameter a measures the speed of reversion to the correct price: The 
higher a is, the faster mispricing x drifts towards zero. Parameter a measures 
the size of new mispricing shocks introduced into the process. It is also useful 
to define S = a"^ /{2a), which is the variance of Xt in the long run. 

Changes in mispricing induce changes in the arbitrageur's wealth through 
his choice of the leverage coefficient f{x): The change in the logarithm of wealth 
is the product of the leverage coefficient and the change in the mispricing, 

du = ,f{x)dx. (2) 
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Intuitively, a 1% change in the mispricing results in a f{x)% change in the in- 
vestor's wealth. Later we will impose certain restriction on the available lever- 
age. 

The arbitrageur's utility is a linear combination of the growth rates in the 
expectation and variance of the portfolio wealth: 

[/ = liminfi[SK)-7Var(Mt)], (3) 

where parameter 7 measures risk aversion of the investor. The optimization 
problem is to choose the leverage function f{x) so that utility U is maximized. 

What is the meaning of this maximization criterion? If 7 is 0, then the 
criterion is the same as the criterion of maximizing the portfolio's long-run 
growth rate - the Kelly criterion. When 7 > 0, it introduces an additional 
term penalizing deviations from the expected growth rate. This additional term 
assures the investor that maximizing U protects him against the large deviations 
in the realized growth of his portfolio from the expectations. 

An example may perhaps add some insight into the investment criterion. 
Suppose that the wealth of the investor follows a geometric Brownian motion 
with constant parameters ^ and a. Then the utility of the investor is 

[/ = ^-7Cr2. 

This expression shows that the utility depends only on the parameters of the 
process and on risk aversion but not on the investment horizon. 

Another way to get an insight into this criterion is to compare it with the 
objective under the classical single period Markowitz model. In the Markowitz 
model the investor maximizes a linear combination of the expectation and the 
variance of the portfolio return. Therefore, the present model generalizes the 
Markowitz model to the dynamic setting by substituting the expectation and 
variance of the single period return with the asymptotic rates of increase in 
expectation and variance of the investor's wealth. 

An important assumption that we adopt in this generalization is that the 
investor is concerned only with long-run consequences of his policy. This as- 
sumption simplifies the analysis considerably and appears to be realistic for 
small investments by large institutions. In using this assumption we follow 
Bielecki and Pliska (1999) and Bielecki et al. (2000)[ who applied it to the anal- 



ysis of continuous investment policies in a similar situation. 

3 Optimal DifFerentiable Policy 

This section is about differentiable policies /(x), for which 



/ e (-00,-1-00), and (5) 
|/'(x)| ^ K. (6) 
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The policies from this class will be called D— policies. This class excludes poli- 
cies that prescribe extremely rapid growth of leverage with respect to mispric- 
ing. The following theorem is a cardinal ingredient in showing that optimal 
D— policies are linear. 

Theorem 3.1 Linear investment policies are the only Ti— policies such that the 
variance of the logarithm of wealth Ut is asymptotically constant. 

The proof uses a convenient representation for u: Let 

5(0-: f fiOdC. (7) 
Jo 

Then it is easy to check that 

2 



Ut ^ uq + g{xt) - g{xo) - ^ I f'{xr)dT. (8) 





The intuition behind this representation is simple. The investor can increase 
his wealth only if he increases his leverage when the mispricing increases. The 
derivative f'{x) measures the sensitivity of the leverage policy to mispricing, 
and JHJ) shows that the change in the logarithm of wealth equals a multiple of 
the integral of f'{x) plus a stationary process. The more sensitive the leverage 
policy is to mispricing, the greater the increase in the wealth induced by local 
variations in mispricing. The addition of g{xt) — gixo) reflects dependence of 
the wealth on the initial and final conditions. 

Proof of Theorem 13.11 Taking the variance of ut — wo in (jHJ gives 
Var(?/t-Mo) = Var {g{xt)ya^Cov (g{xt), f /'(x,)dT Var ( f f'{xr)dT 



/ ^ \J0 



_ (9) 

As t increases, all terms except possibly the third one tend to a finite limit. So, 
asymptotically. 



where 



Var(ut) ~ const + — rt, (10) 



r lim "'"'^o > 0. (11) 

t^oo t 



The rate r = if and only if Var(/'(a;)) = 0. Indeed, if Var(/'(x)) > then 



t rt 



r(t) = ivar(/'(x)) / / Corr(/'(:r,),/'(a;,))rfTds. (12) 



Jo 



According to Proposition lA.ll in Appendix IXl Corr(/'(a;r), /'(a;s)) > 0, so it 
follows from (|12|l that 

rit) ^ -Var(/'(x)) / Idr = Var(/'(x)) > 0. (13) 
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Finally, since Var(/'(x)) = if and only if f'{x) is almost surely constant, 
the investment policy must be linear if the variance of Ut is not to increase with 
time. QED. 

Theorem 13.11 shows that all linear strategies eliminate long-run risk. As a 
natural consequence, the next theorem predicates optimality of linear strategies 
with respect to the asymptotic investment criterion. The idea is to match any 
non-linear strategy with an admissible linear strategy that has higher expected 
return and lower growth in variance. The matching is possible exactly because 
all linear strategies have zero asymptotic growth in variance. 

Theorem 3.2 For the investor with asymptotic preferences, any D— policy is 
dominated by some linear D— policy. 

Proof: Let the non-linear policy be f{x). According to lO, in the long run 



Take the linear policy fL{x) = {E{f'{x)) — e)x with e > 0. For a certain e 
it is admissible. This is because \E{f'{x)) — e\ < K follows from |/'(x)| < K 
everywhere and < if on a set of positive measure, which are both true 

because / is a non-linear D— policy. The expectation of the logarithm of wealth 
under /l is 



It is clearly higher than the corresponding expectation for the non-linear policy. 
From Theorem l3.1l wc know that the variance of ut is asymptotically constant for 
linear policies and is asymptotically equivalent to rt, where r > 0, for non-linear 
policies. It follows that for sufficiently large T the linear policy Jl will have 
lower Var(wT) than the non-linear policy /. Thus /l asymptotically dominates 



It remains to find the optimal policy in the class of linear policies. It turns 
out that it is the policy that has the maximal sensitivity to mispricing. This is 
intuitively clear because every linear strategy eliminates the long-run risk, and 
the policy with the largest sensitivity to mispricing has the highest expected 
return. Formally, the following theorem holds. 

Theorem 3.3 The optimal linear G— policy is f{x) — ~Kx. 

Proof: it is easy to compute 



EM = uo- —E{f{x))t. 



(14) 



E{ut) = uo- —E{f'{x))t + —et. 



(15) 



/. QED. 



lim 

t — >oo 



>o t 

Var(wf) 



E{ut) 




lim 



= 0, 



t 



u 
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Thus, the utility is maximized by the maximal possible k, from which the the- 
orem follows. QED. 

The theorem implies that the arbitrageur should increase the leverage at the 

maximal possible rate. In particular, the optimal strategy docs not depend on 
the risk aversion parameter or properties of the mispricing process. The intuitive 
meaning of this conclusion is that the appropriate use of leverage allows the 
arbitrageur to eliminate all the long run risk. As the next section shows, this 
conclusion will be reversed if the arbitrageur is more constrained in the use of 
leverage. 



4 Optimal Threshold Policy 

When an arbitrageur uses threshold policies he keeps his finger on a button 

that triggers investment while looking at the computer monitor and waiting for 
a mispricing. If he observes a mispricing that exceeds a threshold, S, he pushes 
the button and a fixed amount, L, is directed to this opportunity. When the 
mispricing falls below another threshold, ,s, he pushes another button and the 
position closes. Leverage L never changes when the position is opened. Simple 
threshold policies have equal thresholds: S = s. 

As was said in the Introduction, arbitrageurs use threshold policies because 
they often cannot secure additional funds for positions they already opened. 
They also favor threshold policies because these policies allow economizing on 
transaction costs. 

General threshold policies are complicated to analyze. Fortunately, the fol- 
lowing theorem shows that it is suSicient to study simple threshold policies. 

Theorem 4.1 Any threshold policy is dominated by a simple threshold policy. 

This theorem is given without proof. Intuitively, for the Markov process of 
mispricing the optimal investment policy should not depend on the history of 
investing, and the only threshold policies that pass this selection test arc simple 
threshold policies. Indeed, if s < S, and the mispricing is between s and S, then 
the position is open if the mispricing has fallen from above S but not yet gone 
below .s, and it is closed if the mispricing has risen from below s but not yet 
gone above S. It follows that investment under the threshold rule with s S 
depends on history of investment and therefore cannot be optimal. 

The relevant properties of the simple threshold policies are described in the 
next theorem, which uses the following notation: 



1 2 1 



1 fs' ^ 



(17) 



For S = 0, the value of tp{S) can be computed explicitly: V(0) = 2 In 2/(av'27rI]). 
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[Figure ^ here] 



Figure 1: Plots of Optimal Leverage L as Function of Threshold S 



[Figure [51 here] 



Figure 2: Plots of Optimal Utility U as Function of Threshold S 



Theorem 4.2 For the simple threshold policy with threshold S and leverage L 
lim ^("*-"°) ^c,{L,S)^a^mS) 

t—*oo t 

- ^^°) = C2{L, S) ^ {a^mS) fi^{S). 

t — ^oo t 

The investor's utility is U{L, S) = ci(L, S) — "fC2{L, S). 

The proof is relegated to Appendix IbI 

The first step in obtaining the optimal policy from this theorem is to calcu- 
late reduced utility function that depends only on threshold S : 

Corollary 1 For a fixed threshold S the optimal leverage is 



^ ' A-iaY.(l){S)i^{S) 
and the corresponding utility is 

The functions L{S) and U{S) are illustrated in Figures ^ and El We can 
see that higher long-run variance S leads to an increase in both leverage L and 
utility U. Higher persistence of the process does not change optimal leverage 
but decreases utility. 

The function '0(5') is increasing in 5^, and consequently the maximal utility 
is reached at S* = 0. The optimal threshold policy is given in the following 
theorem. 

Theorem 4.3 Utility is maximized for S = and L = n/{A'y In 2). The optimal 
utility isU ^ a\/27rE/(87 In 2). 

Predictably, the utility is higher when the convergence is fast (a is high) 
and the arbitrage opportunity is large (S is high). Not so predictable is that 
the optimal leverage does not depend on the parameters of the process: This 
leverage optimally balances risk and return for every Ornstein-Uhlenbeck pro- 
cess. What is most important, however, is that the optimal leverage depends 
on the parameter of the risk aversion 7. The higher 7 is, the lower the amount 
is that the arbitrageur is willing to commit to the arbitrage opportunity: The 
arbitrageur that uses only threshold strategies is unable to remove the long-run 
risk and must adjust his behavior. 
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[Tabic □ here] 



Table 1 : Summary Statistics of Mispricing Factor 



[Table H here] 



Table 2: Results of Estimation of Mispricing Factor Process 

5 Empirical Application 

This section studies convergent trading in the context of WEBS, which are 
shares of open-end mutual funds that replicate the price performance of foreign 
stock market indexes. WEBS trade on a stock exchange like ordinary stock, and 
their managers try to keep the fund price close to the net asset value (NAV) of 
their underlying stocks. They buy back shares if the price is less then NAV and 
issue additional shares if the price is greater than NAV. 

As in previous sections, I assume that the investor can hedge the risk of the 
underlying portfolio. Indeed, trading index futures provides a very good hedge 
of country exposure. Absense of the perfect hedge limits the implications of my 
analysis. 

I use price and NAV daily data for WEBS that track indices of Australia, 
Austria, Belgium, Canada, France, Germany, Hong Kong, Italy, Japan, Malaysia, 
Mexico, Netherlands, Singapore, Spain, Sweden, Switzerland, and the United 
Kingdom. This is a total of 17 countries. The data start in March of 1996 and 
end in August of 2000, which gives around 1000 datapoints for each country. 

The mispricing factor xt is computed as the logarithm of the ratio of the 
price to NAV. Some summary statistics for xt are given in Tabled I model the 
dynamic of the mispricing factor as an AR(1) process, which is the discrete-time 
counterpart to the Ornstein-Uhlenbeck process: 

Xt = Pxt-i + (jet, (18) 

The results of estimation of this process are summarized in Table [21 They 
show that (3 is around 0.5, and a is around 0.01. Durbin- Watson statistic shows 
that the AR(1) process is a reasonably good approximation to the true mispricng 
process. 

Our first goal is to get an estimate of the order of the coefficient k in the 
optimal linear strategy. Let us use the following estimates of the order of pa- 
rameters: ^ lO""*, a ^ 0.5. From Theorem l3.3l the long-run variance of the 
logarithm of wealth is 5 • 10~^fc^, and the expected change in the logarithm of 
wealth is 5 • 10~^fc per day. The average daily change in the logarithm of the 
S&P500 index has been 8 • 10"-^ - 10"^ -^^ ^^le last five years. To get this return 
by convergent trading, the sensitivity k of the linear strategy would have to be 
set at 20. The corresponding asymptotic variance of the logarithm of wealth 
would then be 2 • 10^^. This is considerably smaller than the variance of the 
deviation of the logarithm of the S'&P500 index from its linear trend, which for 
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[Figure O here] 

Figure 3: Contour Graph of Mean of Average Daily Returns 



[Figure 0] here] 



Figure 4: Contour Graph of Standard Deviation of Average Daily Returns 

the 5-year period from August 1995 to August 2000 can be estimated at about 
3-10-3. 

These computations suggest that this market could not be efficient if trans- 
actions costs were absent. The goal of the following Monte-Carlo simulations is 
to analyze properties of threshold strategies and to find the optimal threshold 
strategy in situations with transaction costs. I assume that the transaction cost 
c is 0.25%, and that the true process of discounts is AR(1) with /? = 0.3 and 
a = 10-2. 

The simulations were organized as follows. One hundred realizations of the 
mispricing factor process were generated. Each realization had 1250 datapoints. 
For each realization I simulated the process of investing according to a strategy 
from a finites set of threshold strategies. Thus, each pair of a strategy and a 
realization of the mispricing factor process resulted in a realization of wealth 
process. 

For each realization of wealth process, I calculated the average daily return 
as a difference between logarithm of final wealth and logarithm of initial wealth 
divided by the length of the realization. After that, I calculated the mean and 
the variance of this statistic over all realizations of the wealth process that 
corresponded to a particular strategy. Thus, as a final product I had a function 
that mapped each strategy into the expectation and variance of the average 
daily return. 

I used the following set of strategies. A rise in the mispricing over a threshold 
S triggers the opening of the position. When the mispicing returns to the region 
below s £ [0, S], the arbitrageur closes his position and waits for a new trigger 
signal. The threshold 5 was chosen in the range from 0.5% to 2%. The threshold 
s was chosen in the range from S* -I- c to 0%. 

Figure |31 is a contour graph of the mean of the average daily return to a 
strategy. On the vertical axis of the graph is the low threshold S and on the 
horizontal axis is the difference between high and low thresholds S — s. They are 
denominated in percentage terms. The lines on the contour graph correspond 
to the strategies that have the same mean of average daily return. 

This graph suggests that the average return is maximized for s = and S 



[Figure O here] 
Figure 5: Contour Graph of Sharpe Ratio 
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set to some So > c. Therefore, if the investor wants to maximize average return 
he should invest only if the mispricing exceeds the transaction cost by some 
markup. 

Figure ^ is a contour graph of the standard deviation of the average daily 
returns. We can conclude from it that the variance is increasing with an increase 
in the absolute value of the low threshold and with an increase in the difference 
between the thresholds. 

Figure |31 is a countour graph of the ratio of the mean of average daily return 
to its standard deviation. It suggests that the ratio is maximized for the strategy 
that sets s = and S = c. Thus an investor who uses only threshold strategies 
and who wants to maximize the Sharpe ratio should invest immediately when 
the mispricing exceeds his transaction costs. 

6 Discussion 

Section 131 shows that in the class of differentiable policies with bounded deriva- 
tive the optimal policy is linear in the mispricing and the coefficient in the linear 
relationship is the highest possible. The optimal strategy in this case does not 
depend on the risk-aversion of the arbitrageur, and all the long-run risk can be 
eliminated. 

In contrast, according to the results of Sectional if only threshold policies are 
available then the long-run risk is unavoidable and the investment is inversely 
proportional to risk aversion. This conclusion is consistent with the belief that 
arbitrageurs are typically risk-neutral. The suggested reason for this belief is 
that the constraints on flexibility of changes in leverage make the convergence 
trading risky even in the long run. 
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A Auxiliary Statistical Result 

Suppose that x and y are jointly Gaussian random variables, Var(a;) = Var(j/) = 
1, and Cov(a;, y) = f3. 

Proposition A.l If f eB and Ya,r{f{x)) ^ 1, then Cov{f{x)J{y)) £ [0,/?]. 

Proof: Assume without loss of generality that Ef{x) — 0. Since Hermite 
polynomials are complete in the class of D— policies, we can use them to ap- 
proximate /. Then the assertion of Proposition lA. II follows from Proposition 

roi 

Proposition A. 2 /// is a polynomial of degree N, Ef{x) = andVar{f(x)) = 
1, then Cov(/(x), /(y)) £ [f3^ , (]]. The maximum and minimum are achieved 
fi^) — ^ fl'^'^ fi^) — Hn{x), respectively, where Hn{x) is the Hermite 
polynomial of degree N. 

Proof: Represent f{x) as a sum of Hermite polynomials: 

N 

f{x)=Y,akHk{x), (19) 
1 

where by definition 

Hk{x) = exp 



k\ {dxY 



exp 



(20) 



Hermite polynomials form an orthonormal system with respect to the Gaussian 
kernel and possess the following useful property: 

CoY{H,{x),Hj{y))=P'5,,. (21) 

Using this property and orthonormality, we can write 

JV 



Cov(/(x),/(2/))=^a^/3'= (22) 
1 

Var(/(x))=5^ai (23) 



1 



From (|22|l and ^2[\\ . the maximum of Cov(/(x), f{y)) is (3 and it is achieved by 
f{x) — Hi{x) = X. The minimum is (3^ and it is achieved by f{x) = Hn{x). 
QED. 
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B Proof of Theorem 14.21 



Proof: By definition, the threshold pohcy is 



m-r;''''^^^' (24) 

II a; < 5. 



The generahzed Ito formula gives 



ut ^ uq + g{xt) - g{xo) + y"^^ I ^s{xT)dT, (25) 







where ^5 is the Dirac delta-function and 



f nOdC- (26) 
Jo 

The intuition behind this representation is simple: The investor increases his 
wealth only when he triggers the policy. The number of times the policy is 
triggered is stochastic and measured by the integral of the delta function. The 
profit earned at each occasion is proportional to the product of local volatility 
cr^ and leverage L. Finally, there is a dependence of wealth on initial and final 
conditions which is captured by g{xt) — 5(2^0) ■ 

Since g{xt) does not grow with time, the arbitrageur's utility depends only 
on the moments of the integral of the delta function: 

t 

5s{xr)dT. (27) 

The first step in the computation of the moments is calculating the expecta- 
tion and covariance function of the generalized stochastic process 5t.s —'■ Ss{xt)- 
The joint density of xt-^ and xt^ is 

pix,,x,)^ — / ( } '^(;)V7^i')|, (28) 

^ ' 2^2^/1 - a(T)2 ^\ 2^\X2) \a{T) 1 ) ^^2/ J ' ^ ' 

where r = t2 — ti and a(T) = e^"'"^' . The delta function can be approximated by 
■^X[s S-i-A]i where xa denotes the characteristic function of set A and A limits 
to 0. Then, computing two first moments for X[s s+A]i^t) and taking the limit 
A — » give the following formulas: 
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For example, equality (|30|1 can be seen from the following calculation: 



-j^ nOO f-OO 



1 "5 2 



2S(1 -a(r)2) 



1 / ^2 1 

: exp 



27rEv'l - a(r)2 V ^ 1 + a(^) 

(31) 

From ijSni and (jSOJ it follows that 

CoY{dt,,s,0t2,s) = — — ^===cxp -— , . - :7^cxp - — 
27ri;^l - a(T)2 V Sl + a(r)/ 27ri; V S 

(32) 

Since Cov((5tj^ iSfj^g) depends only on r = i2 ^ti, it can be denoted by '!?(t, S"). 
Next, 

6t,sdt\=2j^ dtij^ CoviSt„s,St„s)dt2 

r-T pT-ti 

(substituting r = t2 - ti) = 2 / dti ■d{T,S)dT i^^) 

Jo Jo 

(changing order of integration) = 2 / (T — T)'d{T, S)dT. 

Jo 

Since i?(t, S) — 0{e^'^'^) for a positive c and r ^ oo, and i?(t, S) is integrable 
around t = 0, it follows 

r 

/ i?(t, S')dT const and 
Jo 

/•^ 

/ ti9(t, 5)(iT — + const 
Jo 

as T ^ oo . 

So, for large T 

Var^^ dt,s dtj 2T i?(t, S')dT 

= 2T^expf-^) r f . ^ 
(substituting t = — a^"'^ 

= 0(^)2^(5)T. 



(34) 
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This implies all the assertions of the theorem. QED. 
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